Accuracy versus interpretability in exible modeling : implementing a tradeo using Gaussian process modelsTony
نویسنده
چکیده
One of the widely acknowledged drawbacks of exible statistical models is that the tted models are often extremely diicult to interpret. However, if exible models are constrained to be additive the tted models are much easier to interpret, as each input can be considered independently. The problem with additive models is that they cannot provide an accurate model if the phenomenon being modeled is not additive. This paper shows that a tradeoo between accuracy and additivity can be implemented easily in Gaussian process models, which are a type of exible model closely related to feedforward neural networks. One can t a series of Gaussian process models that begins with the completely exible and are constrained to be progressively more additive, and thus progressively more interpretable. Observations of how the degree of non-additivity and the test error change as the models become more additive give insight into the importance of interactions in a particular model. Fitted models in the series can also be interpreted graphically with a technique for visualizing the eeects of inputs in non-additive models that was adapted from plots for generalized additive models. This visualization technique shows the overall eeects of diierent inputs and also shows which inputs are involved in interactions and how strong those interactions are.
منابع مشابه
Sparse inverse kernel Gaussian Process regression
Regression problems on massive data sets are ubiquitous in many application domains including the Internet, earth and space sciences, and finances. Gaussian Process regression is a popular technique for modeling the input-output relations of a set of variables under the assumption that the weight vector has a Gaussian prior. However, it is challenging to apply Gaussian Process regression to lar...
متن کاملSparse Inverse Gaussian Process Regression with Application to Climate Network Discovery
Regression problems on massive data sets are ubiquitous in many application domains including the Internet, earth and space sciences, and finances. Gaussian Process regression is a popular technique for modeling the input-output relations of a set of variables under the assumption that the weight vector has a Gaussian prior. However, it is challenging to apply Gaussian Process regression to lar...
متن کاملBayesian Nonlinear Support Vector Machines and Discriminative Factor Modeling
A new Bayesian formulation is developed for nonlinear support vector machines (SVMs), based on a Gaussian process and with the SVM hinge loss expressed as a scaled mixture of normals. We then integrate the Bayesian SVM into a factor model, in which feature learning and nonlinear classifier design are performed jointly; almost all previous work on such discriminative feature learning has assumed...
متن کاملSECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS
In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...
متن کاملOn the Capability of a Fuzzy Inference System with Improved Interpretability
Many studies on modeling of fuzzy inference systems have been made. The issue of these studies is to construct automatically fuzzy systems with interpretability and accuracy from learning data based on meta-heuristic methods. Since accuracy and interpretability are contradicting issues, there are some disadvantages for self-tuning method. Obvious drawbacks of the method are lack of interpretabi...
متن کامل